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Abstract 

The quantum enveloping algebra U q (sl(2) © s/(2)) in the limit q — > is proposed as a sym- 
metry algebra for the genetic code. In this approach the triplets of nucleotids or codons in the 
DNA chain are classified in crystal bases, tensor product of U q ^o(sl(2) ©sZ(2)) representations. 
Such a construction might be compared to the baryon classification from quark building blocks 
in elementary particles physics, one of the main differences standing in the property of a crystal 
base to provide a natural order in the state constituents, this order being crucial in the codon. 
Then an operator ensuring the correspondence codon/amino-acid can be constructed out of the 
above algebra. It will be called the reading operator, and be such that two codons relative to 
the same (resp. different) amino-acid(s) acquire the same (resp. different) eigenvalue(s). 

Resume 

L'algebre enveloppante quantique U q {sl{2) © s/(2)) dans la limite q — > est proposee comme 
algebre de symetrie du code genetique. Dans cette approche, les triplets de nucleotides ou 
codons dans la chaine d'ADN sont classifies dans des bases cristallines, produit tensoriel de 
representations de U q -,o(sl(2) ©sZ(2)). Une telle construction peut etre comparee a la classifica- 
tion des baryons a partir des quarks en physique des particules elementaires, une des differences 
essentielles residant dans la propriete d'une base cristalline de fournir un ordre naturel des 
constituents, cet ordre etant crucial dans le codon. Nous construisons un operateur assurant 
la correspondance codon/acide amine, appele operateur de lecture. Cet operateur est tel que 
deux codons relatifs au meme (resp. a des differents) acide(s) amine(s) ont des valeurs propres 
identiques (resp. differentes). 



ENSLAPP- AL-671/97 
DSF-97/37 
physics/9801027 
December 1997 



The mystery of the perfect correspondence between triplets of nucleotides or codons in the des- 
oxyribonucleic acid (DNA) sequence and the amino-acids is called the genetic code [|TJ. Let us, in 
a few words, remind how the DNA conducts the synthesis of proteins, which constitute the most 
abundant organic substances in living matter systems. Indeed, the DNA macromolecule is made of 
two linear chains of nucleotides in the famous double helix structure. Each nucleotide is character- 
ized by one of the four elementary bases: adenine (A) and guanine (G) deriving from purine, and 
cytosine (C) and thymine (T) coming from pyrimidine. The DNA is localized in the nucleus of the 
cell and the transmission of the genetic information in the cytoplasm is achieved by the messenger 
ribonucleic acid or mRNA. This operation is called the transcription, the A, G, C, T bases in the 
DNA being respectively associated to the U, C, G, A bases, U denoting the uracile base. Finally, a 
codon is an ordered sequence of three bases (e.g. AAG, ACG, etc.) and it is a simple exercise to 
numerate 4x4x4 different codons. Except the three following triplets UAA, UAG and UGA, each 
of the 61 others is related through a ribosome to an amino-acid in the universal eukariotic code (see 
Table |^) . Thus the chain of nucleotides in the mRNA - and also in the DNA - can also be viewed as 
a sequence of triplets, each corresponding to an amino-acid, except the three above mentioned ones. 
These last codons are called non-sense or stop-codons, and their role is to stop the biosynthesis. 

One can distinguish 20 different amino-acids []. It follows that different codons can be associ- 
ated to the same amino-acid, or in other words, that the genetic code is degenerated. Considering 
the standard eukariotic code (see Table |2|), one remarks that the codons are organized in sextets, 
quadruplets, triplets, doublets and even singlets, each multiplet corresponding to a specific amino- 
acid. Such a picture naturally led Hornos and Hornos || to look for an underlying symmetry based 
on a continuous Lie group. More precisely, the authors tried to answer the following question: is 
it possible to determine a Lie group G carrying a 64-dimensional irreducible representation R and 
admitting a subgroup H such that the decomposition of R into irreducible representations under H 
gives exactly the different just above mentioned multiplets? They proposed as starting symmetry 
the symplectic group Sp(6) with successive breakings up to its Cartan part U{1) x U{1) x U(l). 

Interpreting the double origin of the nucleotides, each arising either from purine or from pyrim- 
idine, as a Z 2 -grading, a supersymmetric extension of the above model has been proposed || with 
the superalgebra s/(6|l) as the classification (super) algebra before symmetry breaking. A systematic 
search for superalgebras the representation theory of which comes close to the multiplet structure of 
the genetic code has also been recently carried out in ref. ||. 

It is a rather different point of view that we will adopt in this letter. Indeed we will consider 

the four nucleotides as basic states of the (|, \) representation of the U q (sl{2) © sl{2)) quantum 

1 They are denoted by the letters Ala (Alanine), Arg (Arginine), Asn (Asparagine), Asp (Aspartic acid), Cys 
(Cysteine), Gin (Glutamine), Glu (Glutamic acid), Gly (Glycine), His (Histidine), He (Isoleucine), Leu (Leucine), Lys 
(Lysine), Met (Methionine), Phe (Phenylalanine), Pro (Proline), Ser (Serine), Thr (Threonine), Trp (Tryptophane), 
Tyr (Tyrosine), Val (Valine). 
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enveloping algebra in the limit q — > 0. Then a triplet of nucleotides will be obtained by constructing 
the tensor product of three such four dimensional representations. Actually, this approach mimicks 
the group theoretical classification of baryons made out from three quarks in elementary particles 
physics, the building blocks being here the A, C, G, T (U) nucleotides. The main and essential 
difference stands in the property of a codon to be an ordered set of three nucleotides, which is not 
the case for a baryon. Let us be more explicit on an example: there are three different codons made 
of the A, A, U nucleotides, namely AAU, AUA and UAA, while the proton appears as a weighted 
combination of the two u quarks and one d quark, that is \p) ~ \uud) + \udu) + \duu), where the 
spin structure is implicit. 

Constructing such pure states is made possible in the framework of any algebra U q ^{Q) with Q 
being any (semi)-simple classical Lie algebra owing to the existence of a special basis, called crystal 
basis, in any (finite dimensional) representation of Q. The algebra Q = su(2) ©sw(2) ~ so(4) appears 
the most natural for our purpose. First of all, it is "reasonable" to represent the four nucleotides 
in the fundamental representation of Q. Moreover, the complementary rule in the DNA-mRNA 
transcription may suggest to assign a quantum number with opposite values to the couples (A,T/U) 
and (C,G). The distinction between the purine bases (A,G) and the pyrimidine ones (C,T/U) can 
be algebraically represented in an analogous way. Thus considering the representation {\,\) of the 
group 577(2) x SU(2) and denoting ± the basis vector corresponding to the eigenvalues ±| of the J 3 
generator in any of the two su(2) corresponding algebras, we will assume the following "biological" 
spin structure: 

su{2) H 

C =(+,+) — U =(-,+) 

su(2) v I I su(2) v (1) 

su(2) H 

the subscripts H (:— horizontal) and V (:= vertical) being just added to specify the group actions. 

Now, let us turn our attention towards the representations of U q -+a{G) and more specifically to 
their crystal bases. In statistical mechanics, the q — > limit of a deformed (quantum) algebra 
can be interpreted as the absolute zero temperature in a lattice model. Introducing in U q ^ (Q) the 
operators and fa (i — 1, . . . , rank Q) after modification of the simple root vectors and f\ oiU q (Q), 
a particular kind of basis in a ^(C?) -module can be defined. Such a basis is called a crystal basis 
and carries the property to undergo in a specially simple way the action of the and /j operators: 
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as an example, for any couple of vectors u, v in the crystal basis B, one gets u = e^f if and only if 
v = feu. One must note that there is no objection to consider the four states C, U, G, A defined in 
(fj) as constituting a crystal basis for the (|, |) module of U q ^ (sl(2) © sl(2)). More interesting for 
our purpose is the crystal basis in the tensorial product of two representations. Then the following 
theorem holds H: 

Let Bi and £>2 be the crystal bases of the Mi and M 2 Wq_>o({?)- m odules respectively. Then for u e B\ 
and v € £>2, we have: 

7. , \ _ f fi u ® v 3 n > 1 such that 7^ and e^f = , . 

Ji\u®v) — < ~ [2) 

I u ® jjf otherwise 

~ z ^ \ f m®&d 3 71, > 1 such that efv ^ and hu = 
eAu (g) f ) = < _ , — . 1 ' (3) 

ej« ® v otherwise 

To represent a codon, we will have to perform the tensor product of three (|, |) representations of 
U q ^o(sl(2) © s^(2)) . However, it is well-known - and easy to check from Tables [I], ^ - that in a 
multiplet of codons relative to a specific amino-acid, the two first bases constituent of a codon are 
"relatively stable" , the degeneracy being mainly generated by the third nucleotide. For that reason, 
we will prefer to examine a codon as a 2+1 state instead of a simple triplet. So, let us consider in 
detail the first tensor product: 

(f , |) © (|, |) = (1, 1) © (1, 0) © (0, 1) © (0, 0) (4) 

where inside the parenthesis, j = 0, |, 1 is put in place of the 2j + 1 = 1, 2, 3 respectively dimensional 
SU(2) representation. We get, using Theorem 1, the following tableau: 

-> su{2) H (0,0) (CA) (1,0) (CG UG UA) 

i 

( cu\ ( cc uc 

su(2) v (0,1) GU (1,1) GC AC 

\GA j \GG AG 

From Tables |1] and ^|, the dinucleotide states formed by the first two nucleotides in a codon can be 
put in correspondence with quadruplets, doublets or singlets of codons relative to an amino-acid. 
Note that the sextets (resp. triplets) are viewed as the sum of a quadruplet and a doublet (resp. 
a doublet and a singlet). The dinucleotide states associated to the quadruplets (as well as those 
included in the sextets) of codons satisfy: 

4 i3 >o or 4,3 = 0, 4,3 >o, 4^0. (5) 

where Jfj 3 and Jy 3 are the third components of the spin generators of the dinucleotide states. 
The dinucleotide states associated to the doublets (as well as those included in the triplets) and 
eventually to the singlets of codons are such that: 

4,3 <0 or 4,3 = 0, 4,3 <0 or 4 = 0. (6) 
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On the other hand, if we consider the three-fold tensor product, the content into irreducible repre- 
sentations of U q ^{sl{2) © si (2)) is given by: 



a 1^ 

>2' 2' 



'1 1> 
>2> 2' 



'1 1> 
>2> 2' 



'3 3^ 
>2' 2' 



2 (|, f ) © 2 (|, |) © 4 (|, ±, 



The structure of the irreducible representations of the r.h.s. of Eq. (|?p is: 

/ ccc ucc uuc uuu \ 

GCC ACC AUC AUU 

GGC AGC AAC AAU 

\ GGG AGG AAG AAA J 

CCG UCG UUG UUA 

GCG ACG AUG AUA 



'3 3> 
>2' 2' 



•3 \- 
■ 2' 2- 



(7) 



^2' 2> 



'I 1) 

.2' 2> 



'3 ly 
*2' 2/ 



CGC £/GC [/AC C/AC/ 
CGG UGG UAG UAA 



( CCU UCU \ 
GCU ACU 
GGU AGU 
\ GGA AG A J 



CCA 
CCA 



UCA 
ACA 



'1 ly/ 

>2» 2/ 



CC/G CUA 
GUG GUA 



'1 3y 
2/ 



'1 ly 

>2' 2> 



( cue cuu \ 

GUC GUU 
GAC GAU 
\ GAG GAA j 

CGU UGU 
CCA UGA 



i i v „ = / CAC CAU 
2 ' 2J ~ \ CAG CAA 



As expected from formulae and our model cannot gather codons associated to one par- 
ticular amino- acid in the same irreducible multiplet. However, it is possible to construct an operator 
1Z out of the algebra U q ^ (sl(2) © sl(2)), acting on the codons, that will describe the genetic code in 
the following way: 

Two codons have the same eigenvalue under 1Z if and only if they are associated to the same 
amino-acid. 

This operator will be called the reading operator. It has the following form: 



K 



| Cl C H + |c 2 C v - 4ci Vi J H ,3 ~ 4:C 2 V 2 J V ,3 + (V 3 c 3 + V 4 C 4 ) Jy, 3 



+^5 c 5 (I - 4%) + (V,q + V 6 q>) (i - 4%) Jgj . 



(3)\ t(3) 



(8) 



In Eq. (H), the operators J# )3 and Jy,3 are the third components of the total spin generators of the 
algebra W g ^o('5/(2)©s/(2)), J H3 , Jy% are the third components corresponding to the third nucleotide 



of a codon. Of course, these last two operators can be replaced by J^ 3 = J a ^ — J^ 3 
The operator C a (a 



lot 



H,V). 



H,V) is a "Casimir" operator of U q ^o(sl(2)) in the crystal basis. It is 
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characterized by the property that it commutes with J±,h, J±,v an d Jh,3, Jv,z (where J± t H, J±,v are 
the generators with a well-defined behaviour for q — > 0) and its eigenvalues on any vector basis of an 
irreducible representation of highest weight J is J(J+ 1), i.e. the same as the undeformed standard 
second degree Casimir operator of si (2). Its explicit expression is 

n 

c = {hf + 2 E D j-) n -\j + nj-) k ■ (9) 

Note that for sl(2) q -> the "Casimir" operator is an infinite series of powers of J_ and J+. However in 
any finite irreducible representation only a finite number of terms gives a non- vanishing contribution. 
V{ (i — 1, . . . , 5) are projectors given by the following expressions: 

"^i — 4+ Jh- > 

^2 = 4+ 4- > 

^3 — Jh- 4+(2 ~~ 4+ 4- — ^v-) + (1 — ^h- — Jr+ — ^y-) > 

^ = (4_ 4 + ) [(4 + - 4+ 4-) + (4 + J v -){J d v- 4 + )(i - 4+ ■#_)] , 

^5 = (4- 4 + )(4 + - 4 + 4-) • (io) 

The projectors Vq,V 6 appear only for the eukariotic code. Their expressions are given by: 

^6 = (Jh- Jh+)(Jv- 4+)( 1 ~ 4+ 4-) (4+ 4-) > 

n = (4- 4+)(i - 4- 4+X4+ 4-)(i - 4+ 4-) • (n) 

The terms in c\ and C2 are responsible for the structure in quadruplets (given essentially by the 
dinucleotide content). The terms in C3 give rise to the splitting of the quadruplets into doublets. 
The terms in C4 and C5 lead to the sextets. Finally, the terms in q and q', that appear only in the 
eukariotic code, are responsible for the singlet and triplet structure. 

Now, using the values of the quantum numbers Jh, Jv, Jh,3, Jv,3, 4± 4± ( a = V) of the 
codons given in Tables 3 and 4, one can compute the action of the reading operator 1Z on each of 
the 64 codons. 

Although the eukariotic code (EC) seems to be a universal genetic code, it appears in some 
way as an advanced form of the vertebral mitochondrial code (VMC). Indeed there is very few 
difference between the two codes. The codons in the VMC are organized into 2 sextets, 6 quadruplets 
and 14 doublets. When evolving from the VMC to the EC, one doublet and one quartet merge 
together to form a sextet while two other doublets split into four singlets, two of them gluing with 
existing doublets to form two triplets. The final result for the EC is 3 sextets, 5 quadruplets, 10 
doublets, 2 triplets and 2 singlets. Hence, it appears natural to start to calculate 1Z for the vertebral 
mitochondrial code. 
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a) Vertebral Mitochondrial Code: 

One finds the following eigenvalues of the reading operator 1Z in the case of the vertebral mitochon- 
drial code, identifying the amino-acids with its corresponding codons (Ser corresponds to the codons 
UCX (X=C,U,G,A) while Ser' corresponds to the codons AGC/AGU; similarly Leu is related to the 
quartet CUX and Leu' to the doublet UUG/UUA; finally Arg is given by the quartet CGX and Ter' 
to the doublet AGG/AGA): 



The parameters C3, c 4 are fixed by the following requirements. The condition Leu = Leu' leads to 
the expression of the coefficient C3 in function of c\ and c 2 , and one obtains C3 = — 8c\. At this point, 
one is led to add a correcting term in 1Z since the symmetry of the genetic code implies He = Val 
and Cys = Arg as soon as Leu = Leu' while Ser' is not equal to Ser. Hence the projector P 4 has a 
non- vanishing value on the AG, UG and AU dinucleotides. The condition Ser' = Ser then implies 
c 4 = 8ci + 12c 2 . At this point, He and Val on the one hand, and Cys and Arg on the other hand 
become different as required. Finally, the parameter C5 is fixed for the VMC by requiring that Ter' 
= Ter. One finds C5 = 6ci + 14c2. The demand to be satisfied by 1Z in order to provide different 
eigenvalues to codons associated to different amino-acids implies the non- vanishing of c\ and c 2 . This 
leads, after a rescaling, to express the reading operator for the vertebral mitochondrial code as (where 
c = Ci/c 2 ): 



Pro = —C\ — c 2 

Ala = —C\ + 3c 2 

Asp — ci + 5c 2 - |c 3 

Tyr = 5ci + c 2 + |c 3 

Asn = 5ci + 5c 2 — |c 3 

His = ci + c 2 + |c 3 

Arg = -d + c 2 

Cys = 3ci + c 2 + |c 3 + |c 4 

Ser' = 3ci + 5c 2 — |c 3 — |c 4 

Val = ci + 3c 2 

Phe = 5ci - c 2 + |c 3 

He = 5ci + 3c 2 + |c 3 + |c 4 



Thr = 3ci + 3c 2 

Ser = 3ci — c 2 

Glu = ci + 5c 2 — |3c 3 

Ter = 5ci + c 2 - |c 3 

Lys = 5ci + 5c 2 - |c 3 

Gin = ci + c 2 - |c 3 

Gly = — ci + 5c 2 

Trp = 3ci + c 2 - |c 3 - |c4 

Ter' = 3ci + 5c 2 - |c 3 - |c 4 + c 5 



(12) 



Leu = ci — c 2 



Leu' = 5ci — c 2 + |c 3 

Met = 5ci + 3c 2 - |c 3 - |c 4 



|c^ + |Cy- 4cP! J^ 3 " 4P 2 Jy, 3 + ("8cP 3 + (8c + 12) P 4 ) Jy, 3 

+(6c+14)^ 5 (|-45). 



(13) 
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and therefore to the following values for the amino-acids: 



Oj. Oj. 


value of the codon 


Oj. Oj. 


value of the codon 


Oj.Oj. 


value of the codon 


Ala 


-c + 3 


Gly 


-c + 5 


Pro 


-c- 1 


Arg 


-c+1 


His 


-3c + 1 


Ser 


3c- 1 


Asn 


9c + 5 


He 


5c + 9 


Thr 


3c + 3 


Asp 


5c + 5 


Leu 


c- 1 


Trp 


3c- 5 


Cys 


3c + 7 


Lys 


17c + 5 


Tyr 


c+1 


Gin 


5c + 1 


Met 


5c -3 


Val 


c + 3 


Glu 


13c + 5 


Phe 


-7c- 1 


Ter 


9c + 1 



(14) 



The vertebral mitochondrial code 
We remark that the reading operator TZvmc(c) can be used for any real value of c, except those 
confering the same eigenvalue to codons relative to two different amino-acids. These forbidden values 
are the following: -7, -5, -4, -3, -§, -J, -2, -§, -§, -f , -1, -§, -f , -f , -f , -§, -§, -±, 

_3 _2 _3 _1 __3_ _2 _1 _2 _1 _1 _1 _1 _1 nlllll212-i 4 3 o 5 o 

7' 5' 8' 3' 10' 7> 4> 9' 5' 6' 7' 8' 9' ' 7' 6' 5' 4' 3' 5' 2' 3' ' 3' 2' > 2' ' 

4, 5. 

b) The Eukariotic Code: 

In the case of the eukariotic code, most of the eigenvalues of the reading operator are the same. The 
difference between VMC and EC comes i) from the doublets Met and Trp that split into singlets 
Met (AUG) + He" (AUA) and Trp (UGG) + Ter" (UGA), and ii) from the doublet Ter' that merge 
with the quartet Arg to form a sextet. The eigenvalues for the new structures are the following: 

Ter" = 3ci + c 2 - ±c 3 - \c A - q' 

Trp = 3ci + c 2 - |c 3 - 5C4 + g' ^ 15 ^ 
He" = 5ci + 3c 2 — |c 3 — |c 4 — q 
Met = 5ci + 3c 2 - |c 3 - |c 4 + g 

The parameters c 3 , c 4 are given as in the VMC. The parameter c 5 is now fixed by the condition Ter' 
= Arg. One obtains c 5 = — 4c x + 14c 2 . The parameters q and q' describe the splitting of the doublets 
Met and Trp into the singlets: they are determined by requiring He" = He and Ter" = Ter. II follows 
that q = — 12c 2 and q' = — 6ci — 6c 2 . Hence the reading operator for the eukariotic code reads as: 

n EC (c) = IcCh + ICv-^cV! J h ,z-AV 2 J v ,3 + (ScV 3 + (8c +12) V 4 ) J v , 3 

+(-4c + 14) V 5 (| - 4%) - 6 (2 V 6 + (c + 1) V' & ) (| - 41) J§1 . (16) 

where as in case (a) we have achieved a rescaling and c = Ci/c 2 . This leads to the following values 
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for the amino-acids: 



cL. cL. 


value of the codon 


cL. cL. 


value of the codon 


cL.cL. 


value of the codon 


Ala 


-c + 3 


Gly 


-c + 5 


Pro 


-c- 1 


Arg 


-c + l 


His 


-3c + 1 


Ser 


3c- 1 


Asn 


9c + 5 


He 


5c + 9 


Thr 


3c + 3 


Asp 


5c + 5 


Leu 


c- 1 


Trp 


-3c- 11 


Cys 


3c + 7 


Lys 


17c + 5 


Tyr 


c+l 


Gin 


5c + 1 


Met 


5c- 15 


Val 


c + 3 


Glu 


13c + 5 


Phe 


-7c- 1 


Ter 


9c + 1 



(17) 



The eukariotic code 

As in the case (a), we have to rule out the values of the parameter c such that different amino- 
acids get the same eigenvalue under 1Z EC (c). The forbidden values now are the following: —8, —7, 

_c r _a _7 _o _5 _7 _o _5 _3 _4 _i _5 _4 _3 _2 _3 _1 _3 _2 _3 _1 

U ' d ' > 2' °> 2' 3' ' 3' 2' 3' ' 6' 5' 4' 3' 5' 2' 7' 5' 8' 3' 

__3_ _2 _1 _2 _1 _1 _1 _1 _1 nllll212i 7397580107^970 

10' 7' 4' 9' 5' 6' 7' 8' 9''7'5'4'3'5'2'3''6'2''3'2'3''3'2''2''' 

11. 

The simple model that we propose needs obviously to be developed. First on the symmetry 
point of view, it would be nice to understand or at least to include naturally in our approach the 
existence of sextets. Concerning the group structure, we have chosen U q ^Q(Q) with a minimal group 
G = SU(2) x 577(2), keeping in mind a physical interpretation. Of course, a larger symmetry might 
be of some help. As a second step, it will be reasonable to consider a more realistic model including 
interactions among bases. We wish to be soon able to apply our approach on the one hand for 
mutations and on the other hand in the fundamental problem of genome sequence. 

Let us end this note by the following general remark. There are intense efforts these days to 
develop an interface between physics and biology. Different approaches are considered, among them 
the study of the DNA as an ideal polymer in the framework of statistical physics. But no direct 
connection between biology and elementary particle physics already showed up, in our knowledge. 
We hope that our proposal will raise up the interest of elementary particle physics in biology. 
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CCC Pro 
CCU Pro 
CCG Pro 
CCA Pro 


UCC Ser 
UCU Ser 
UCG Ser 
UCA Ser 


GCC Ala 
GCU Ala 
GCG Ala 
GCA Ala 


ACC Thr 
ACU Thr 
ACG Thr 
ACA Thr 


CUC Leu 
CUU Leu 
CUG Leu 
CUA Leu 


UUC Phe 
UUU Phe 
UUG Leu 
UUA Leu 


GUC Val 
GUU Val 
GUG Val 
GUA Val 


AUC lie 
AUU He 
AUG Met 
AUA Met 


CGC Arg 
CGU Arg 
CGG Arg 
CGA Arg 


UGC Cys 
UGU Cys 
UGG Trp 
UGA Trp 


GGC Gly 
GGU Gly 
GGG Gly 
GGA Gly 


AGC Ser 
AGU Ser 
AGG Ter 
AGA Ter 


CAC His 
CAU His 
CAG Gin 
CAA Gin 


UAC Tyr 
UAU Tyr 
UAG Ter 
UAA Ter 


GAC Asp 
GAU Asp 
GAG Glu 
GAA Glu 


AAC Asn 
AAU Asn 
AAG Lys 
AAA Lys 



Table 1: The vertebral mitochondrial code. 



CCC 


Pro 


UCC 


Ser 


GCC 


Ala 


ACC 


Thr 


CCU 


Pro 


UCU 


Ser 


GCU 


Ala 


ACU 


Thr 


CCG 


Pro 


UCG 


Ser 


GCG 


Ala 


ACG 


Thr 


CCA 


Pro 


UCA 


Ser 


GCA 


Ala 


ACA 


Thr 


CUC 


Leu 


UUC 


Phe 


GUC 


Val 


AUC 


He 


CUU 


Leu 


UUU 


Phe 


GUU 


Val 


AUU 


He 


CUG 


Leu 


UUG 


Leu 


GUG 


Val 


AUG 


Met 


CUA 


Leu 


UUA 


Leu 


GUA 


Val 


AUA 


He 


CGC 


Arg 


UGC 


Cys 


GGC 


Gly 


AGC 


Ser 


CGU 


Arg 


UGU 


Cys 


GGU 


Gly 


AGU 


Ser 


CGG 


Arg 


UGG 


Trp 


GGG 


Gly 


AGG 


Arg 


CGA 


Arg 


UGA 


Ter 


GGA 


Gly 


AGA 


Arg 


CAC 


His 


UAC 


Tyr 


GAC 


Asp 


AAC 


Asn 


CAU 


His 


UAU 


Tyr 


GAU 


Asp 


AAU 


Asn 


CAG 


Gin 


UAG 


Ter 


GAG 


Glu 


AAG 


Lys 


CAA 


Gin 


UAA 


Ter 


GAA 


Glu 


AAA 


Lys 



Table 2: The eukariotic code. 
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eULlUll 


d. d. 


Ttt 

Jh 




Jh,3 


Jv,3 


L/UL1U11 


d.d. 


Ttt 

Jh 


J v 


Jh,3 


Jv,3 


ccc, 

V V. V 


1 ^ TV •» 
1 1U 


3/2 

O I Zj 


3/9 


3/9 

O / Zj 


3/9 

O / zL 


VI V. v. 


Ala 


3/9 

O I zl 


3/9 

O / zL 


3/2 

O / zl 


1 11 
l/Z 


VjVj KJ 


1 ^ I'M 
1 1U 


1 /2 

l/Z 


3/9 


1 /9 

l/Z 


3/9 

O / zL 




Ala 


1 /2 

l/Z 


3/9 

O / zL 


1 /2 

l/Z 


1 /2 

l/Z 


V. v_ VJT 


1 ^ I'M 


3/2 

O / zL 


1 /9 

l/Z 


3/9 

O / Zj 


1 /9 

l/Z 


V I V. V I 


Ala 


3/9 

O I zl 


1 /2 

l/Z 


3/2 

O / zl 


-1 11 

l/Z 


CC A 


Pro 


1 /2 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


nrA 

VJV i 1 


Ala 


1 /9 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


1 /9 

-l/Z 


V_ KJ V_ 


T ,pn 

J-J" u. 


1 /2 

l/Z 


3/9 


1 /9 

l/Z 


3/9 

O / zL 


OTIC! 

VI KJ V. 


V dl 


1 /2 

l/Z 


3/9 

O / zL 


1 /2 

l/Z 


1 /2 

l/Z 


HTTTI 

v_ L/ KJ 


T ,ph 


1 /2 

l/Z 


3/9 


-1 /9 

l/Z 


3/9 

<J / Zj 




Val 

V dl 


1 11 

l/Z 


3/9 

O / zL 


-1 /2 

l/Z 


1 /2 

l/Z 


nun 

V L.' VI 


T ,ph 


1 /2 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


VJT KJ V I 


V dl 


1 /2 

l/Z 


1 /2 

l/Z 


1 /2 

l/Z 


-1 /2 

l/Z 


CTTA 

V_ KJ 1 V 


T 

JJCU 


1 /2 

l/Z 


1 /9 

l/Z 


-1 /9 

l/Z 


1 /9 

l/Z 


HTIA 

VJT L.' il 


Val 

V dl 


1 /2 

l/Z 


1 /2 

l/Z 


-1 /2 

l/Z 


-1 /2 

l/Z 


V VI V. 


A T*cr 


O / Zj 


1 /9 

l/Z 


3/9 

O / zL 


1 /9 

l/Z 


V I V I V. 


Olv 


3/9 

O I zL 


3/9 

O / zL 


3/2 

O / zL 


-1 /2 

l/Z 


POTT 

V VI KJ 


Alg 


1 /2 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


VJT VJT (J 


niv 


1 /9 

l/Z 


^ /9 

OIL, 


1 /9 

l/Z 


1 11 

-l/Z 


V VI VI 


A T*(T 

/\ig 




1 /9 

l/Z 


3 /9 


1 /9 

-l/Z 


nnn 

V_T V_T V_T 


niv 

^iy 


3 /9 


^ /9 

O / zl 


3 /9 


3 /9 


V. VI 1 V 


A T*cr 

/\ig 


1 /9 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


-1 11 

l/Z 


nnA 

V I V 1 1 1 


niv 


1 /2 

l/Z 


^/9 

O / zl 


1 /2 

l/Z 


-3/9 


V i Yv 


His 

lllo 


1 /9 

l/Z 


1 11 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


VI 1 V V 


A en 
iiop 


1 /2 

l/Z 


3/9 

O / zL 


1 /2 

l/Z 


-1 /2 

l/Z 


CATT 


His 

lllo 


1 /9 

l/Z 


1 /9 

l/Z 


-1 /9 

l/Z 


1 /9 

l/Z 


OATI 

VI i V KJ 


A QT» 

ilo[J 


1 11 

l/Z 


3/9 

O / zL 


-1 /2 

l/Z 


-1 11 

l/Z 


TAP 


VJXlll 


1 /9 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


1 11 

-l/Z 


PAP 
v I . \ V ' 


nin 


1 /9 

l/Z 


^/9 


1 /9 

l/Z 


3 /9 


TA A 

V./i li V 


Oln 

vjrlll 


1 /9 
l/Z 


1 /9 

l/Z 


1 /9 
-l/Z 


1 /9 

-l/Z 


C4A A 


vjrlll 


1 /9 

l/Z 


^/9 

O / z_, 


1 /9 

-l/Z 


3 /9 


unci 

V. V. V. 


kJ"l 


3/9 

O / Zj 


3/9 


1 /9 

l/Z 


3/9 

O / zL 


Ar,r, 


± 111 


3/9 

O I zL 


^/9 

O / zl 


1 11 

l/Z 


1 /9 

l/Z 


TTCTJ 

KJ V 


StPT 

kJ ei 


1 /2 

1/ z 


3/9 

O / Zj 


-1 /2 

l/Z 


3/9 

O / zj 


ACT! 

i IV V/ 


-L 111 


1 /2 

l/Z 


3/2 

O / zj 


-1 /2 

l/Z 


1 /2 

1/ z 


L.' V. VI 


kJt-1 


3/9 

O I Zj 


1 /9 

l/Z 


1 /9 

l/Z 


1 /9 

l/Z 


1 Y V VI 


Thr 

± 111 


3/9 

O / zl 


1 /2 

l/Z 


1 /2 

l/Z 


-1 11 

l/Z 


TTCA 


Spr 


1 /9 

l/Z 


1 /9 

l/Z 


-1 /9 

l/Z 


1 /9 

l/Z 


ATA 


Thr 

± 111 


1 /2 

l/Z 


1 /2 

l/Z 


-1 11 

l/Z 


-1 /2 

l/Z 


TIT If! 

L/ L/ V_ 


Php 
i ii" 


3/9 

O / Zj 


3/9 


-1 /9 

l/Z 


3/9 

O I zL 


ATTC 


Tip 

11C 


3/9 

O / zl 


3/9 

O / zL 


-1 /2 

l/Z 


1 /2 

l/Z 


TITITI 

KJ KJ KJ 


PTip 

1 11C 


3/9 

O / zL 


3/9 


-3/9 

O I zl 


3/9 

(J / Zj 


ATITI 

il KJ KJ 


Tip 

11" 


3/9 

O / zl 


3/9 

O / zL 


-3/9 


1 11 

l/Z 


TTTTO 


T ,ph 


3/9 

O / Zj 


1 /9 

l/Z 


-1 /9 

l/Z 


1 /2 

l/Z 


ATIO 

il KJ VI 


Mpt 

IV 1 V Ij 


3/9 

O / zl 


1 /2 

l/Z 


-1 11 

l/Z 


-1 /2 

l/Z 


TTTT A 


JjcU. 


^/9 


1 /9 

l/Z 


3 /9 


1 /9 

l/Z 


ATTA 

n U il 


Mpt /Tip 

IVlt: t / lit: 


^ /9 

O / zl 


1 /9 

l/Z 


3 /9 


1 /9 

-l/Z 


TJGC 


Cvs 


S/2 

O / Zj 


1 /2 

1/ z 


1 /9 

1/ z 


1 /2 

1/ z 


AGC 

i 1 V I V. 


Spr 


S/2 

O / zj 


3/2 

O / zj 


1 /2 

l/Z 


-1 /2 

l/Z 


UGU 


Cys 


1/2 


1/2 


-1/2 


1/2 


AGU 


Ser 


1/2 


3/2 


-1/2 


-1/2 


UGG 


Ter/Trp 


3/2 


1/2 


1/2 


-1/2 


AGG 


Ter /Arg 


3/2 


3/2 


1/2 


-3/2 


UGA 


Ter 


1/2 


1/2 


-1/2 


-1/2 


AGA 


Ter / Arg 


1/2 


3/2 


-1/2 


-3/2 


UAC 


Tyr 


3/2 


1/2 


-1/2 


1/2 


AAC 


Asn 


3/2 


3/2 


-1/2 


-1/2 


UAU 


Tyr 


3/2 


1/2 


-3/2 


1/2 


AAU 


Asn 


3/2 


3/2 


-3/2 


-1/2 


UAG 


Ter 


3/2 


1/2 


-1/2 


-1/2 


AAG 


Lys 


3/2 


3/2 


-1/2 


-3/2 


UAA 


Ter 


3/2 


1/2 


-3/2 


-1/2 


AAA 


Lys 


3/2 


3/2 


-3/2 


-3/2 



Table 3: J a , (J a ,3)' values of total/third component spin of su(2) a , (a = H, V). 



(in the columns "amino-acids" (a. a.), left is for VMC and right for EC) 
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Q1I1UC1. 


to! jd 


jd id 

J H- J H+ 


jd jd 


id id 


CP, 


1 

_L 


n 


1 


n 


PTT 


n 
u 


n 
u 


1 
1 


n 
u 


PP 
Kj\j 


i 
i 


n 
u 


n 
u 


n 
u 




n 
u 


n 
u 


n 
u 


n 
u 


Tin 


i 
± 


i 
± 


i 


n 


U U 


n 
u 


i 
i 


1 
1 


n 
u 


TTP 

U Vj 


i 
i 


i 
i 


n 
u 


n 
u 


TT A 


n 
u 


i 
i 


n 
u 


n 
u 


GC 

v.l v. 


1 


n 


1 


1 


GU 








1 


1 


GG 


1 








1 


GA 











1 


AC 


1 


1 


1 


1 


AU 





1 


1 


1 


AG 


1 


1 





1 


AA 





1 





1 



Table 4: Values of J% ± J^ ± (a = H, V) for the dinucleotides formed by the first two nucleotides. 
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